Explaining Model Confidence Using Counterfactuals

T Le; T Miller; R Singh; L Sonenberg

Conference Proceedings

Explaining Model Confidence Using Counterfactuals

T Le, T Miller, R Singh, L Sonenberg

Proceedings of the 37th Aaai Conference on Artificial Intelligence Aaai 2023 | ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE | Published : 2023

DOI: 10.1609/aaai.v37i10.2636126399

Open access

Abstract

Displaying confidence scores in human-AI interaction has been shown to help build trust between humans and AI systems. However, most existing research uses only the confidence score as a form of communication. As confidence scores are just another model output, users may want to understand why the algorithm is confident to determine whether to accept the confidence score. In this paper, we show that counterfactual explanations of confidence scores help study participants to better understand and better trust a machine learning model’s prediction. We present two methods for understanding model confidence using counterfactual explanation: (1) based on counterfactual examples; and (2) based on ..

View full abstract

University of Melbourne Researchers

Liz Sonenberg Author

Related Projects (1)

Explanation in Artificial Intelligence: A Human-Centred Approach

This project aims to produce validated methods for creating human-centred explanations of decisions made by artificial intelligence (AI). Tr..

Grants

Awarded by Australian Research Council

Funding Acknowledgements

This research was supported by the University of Melbourne Research Scholarship (MRS) and by Australian Research Council (ARC) Discovery Grant DP190103414: Explanation in Artificial Intelligence: A Human-Centred Approach.